AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal Document QA

# Multimodal Document QA

Glm 4vq
4-bit quantized version of GLM-4V-9B, supporting multimodal multilingual understanding with memory usage under 9G, outperforming multiple mainstream models
Image-to-Text Transformers Supports Multiple Languages
G
nikravan
440
33
Layoutlm Invoices
A multimodal LayoutLM model fine-tuned for invoice and other document QA tasks, supporting discontinuous text recognition
Image-to-Text English
L
aslessor
16
2
Layoutlm Invoices
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for processing discontinuous text recognition in invoices and other documents
Text-to-Image Transformers English
L
magorshunov
145
57
Layoutlm Invoices
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for handling invoice and other document QA tasks
Text-to-Image Transformers English
L
impira
75.42k
198
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase